Towards high performance continuous Mandarin digit string recognition

نویسندگان

  • Yonggang Deng
  • Taiyi Huang
  • Bo Xu
چکیده

In this paper, we address the problem of high performance speaker-independent continuous Mandarin digital string recognizer and focus on exploiting context information and prosody knowledge. Data-driven decision tree method to train tri-phone acoustic model was proposed. According to Chinese language property, digital specific question set was designed and the derived tri-phone model is more accurate to describe acoustic observation. For prosody cue, a novel Gaussian Mixture Density Duration Model (GMDDM) was presented. Unlike traditional normalizing or single parameter strategy, proposed duration model is context independent. The context variation is naturally embodied into multiple Gaussian distribution mixture. The number of mixture is automatically selected according maximum likelihood criteria. This simple but effective duration model’s likelihood score is combined with acoustic score as heuristic information for the backward A* decoding of word graph. Experimental results show the tri-phone acoustic model could lead to average 12.9% reduce of string error rate. When GMDDM model is applied, the string error rate is further reduced by 22.7%, which demonstrates the very usefulness of GMDDM model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Method for Removing Deletion Errors in Quickly-spoken Connected Mandarin Digit String Speech Recognition

Connected Mandarin digit string speech, especially at rapid spoken rate, is very difficult to recognize correctly. In this paper, a new training method named neighboring digits pattern is proposed in order to eliminate most of deletion errors which frequently occur in Mandarin digits speech recognition at high speaking rate when we have enough quickly-spoken speech data as the training set. The...

متن کامل

Duration Modeling in Mandarin Connected Digit Recognition

Digit string recognition is required in many applications which need to recognize numbers such as telephone numbers, credit card numbers, date, etc. In order to design a high performance recognizer, duration information is explored in this study. In a Mandarin connected digit recognizer, insertion and deletion errors amount to more than two thirds of the total recognition errors because there e...

متن کامل

High-Order Hidden Markov Model and Application to Continuous Mandarin Digit Recognition

The duration and spectral dynamics of speech signal are modeled as the duration highorder hidden Markov model (DHO-HMM). Both the state transition probability and output observation probabilities depend not only on the current state but also several previous states. Recursive formulas have been derived for the calculation of the log-likelihood score of optimal partial paths. The high-order stat...

متن کامل

Noisy Speech Recognition Performance of Discriminative HMMs

Discriminatively trained HMMs are investigated in both clean and noisy environments in this study. First, a recognition error is defined at different levels including string, word, phone and acoustics. A high resolution error measure in terms of minimum divergence (MD) is specifically proposed and investigated along with other error measures. Using two speaker-independent continuous digit datab...

متن کامل

Performance of Mandarin Connected Digit Recognizer with Word Duration Modeling

Digit string recognition is required in many applications such as automatic banking system, database information retrieving system, etc. In order to design a high performance recognizer, duration information is explored in this study. In a Mandarin connected digit recognizer, insertion and deletion errors amount to more than two thirds of the total recognition errors because there exist two mon...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000